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ABSTRACT 



A model for predicting retention of regular Marine Corps officers 
is formulated. Using data from the three cohorts 1956 - 58, a statisti- 
cal test is used to show that the lifetimes on active duty of members of 
each cohort follow the same distribution. Unknown parameters for this 
distribution are estimated from the data. Retention figures for various 
lengths of service of the cohorts 1959 - 63 are calculated with the 
model and compared with actual data. Confidence intervals for the 
predictions are given. Use of the model and follow-on studies are 
suggested and examples given. 
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I. SUMMARY 



A, INTRODUCTION 

The need for predicting and controlling the number of officers on 
active duty in the Marine Corps is a very real and continuing problem. 
Since overall officer strength is established by law, personnel planners 
must determine the number of officers to be introduced into the system 
so that future needs will be met without exceeding prescribed limits. 

Marine Corps officers come on active duty with either a regular 
or reserve commission. Officers with regular commissions have a speci- 
fied obligated period of service, after which they may either resign 
their commission or remain on active duty as they so desire. On the 
other hand, reserve officers have three alternatives. During their in- 
itial obligated period of service they may be given the opportunity to 
apply for a regular commission; if they do so and are accepted they can 
then stay on active duty of resign (after their obligated time) as they 
so desire. Their other alternatives are to leave active duty after com- 
pleting their obligated period or to apply for additional time on active 
duty as a reserve, subject to approval of the Commandant. 

The fact that regular officers have the freedom to choose when they 
will leave active duty raises the question of how can one predict the 
number of officers that will be on active duty some time in the future. 
This problem of predicting future group sizes of regular officers is the 
motivation for this study. 

B * PURPOSE 

* 

This study was undertaken to investigate attrition and retention 
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rates of different year-groups (henceforth called cohorts) of regular 
Marine Corps officers. Cohorts studied consisted of all officers re- 
ceiving commissions as regular Marine Corps Second Lieutenants during 
a calendar year. 

Data was taken for three cohorts, 1956 - 58. After computing the 
retention rates realized from these groups, the stationarity properties 
were investigated. 

Let p.(jk) = P [Member of cohort 19 jk is on active duty i years 
after commissioning]. 

If cohorts behave essentially the same for different year groups, then 
P^(jk) = p^ for all jk. That is, the p^ are stationary over time and 
independent of the cohort initial size. This hypothesis is tested in 
Section III B. Results of the test lead to the formulation of a pre- 
diction model, and predictions for year groups 1959 - 63 are compared 
with actual figures in Section III D. 

C. RESULTS 

Analysis of cohorts 1956 - 58 indicate that attrition from these 
groups can be considered stationary at least through 1968, the last year 
that data was available at the time of the study. A prediction model 
was formulated based on the overall yearly retention realized from the 
total membership of these three cohorts. For example, p^ is computed by 
dividing the total number of officers from cohorts 1956 - 58 still on 
active duty i years after commissioning by the total initial size of the 
three groups. Using these p^ predictions are made of the number remain- 
ing on active duty in various years from cohorts 1959 - 63. These pre- 
dictions agree well with actual figures for these cohorts. 
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This study does not attempt to predict retention by rank or promotion 
frequency, nor is the reason for officers leaving active duty considered. 

Section IV contains a discussion of the similarities and differences 
between the cohort model and Markov Chain Models which have been discus- 
sed widely in the literature. 

In Section V A, it is suggested that the methodology used to develop 
the model can be employed to develop similar models for other cohorts 
such as reserve officers, aviators, etc. After developing such models, 
a cost model could be developed to compare the procurement costs per ex- 
pected year of service for various cohorts. 

It is suggested in Section V B that an application of the model 
might be to assist personnel planners in determining the number of re- 
serve officers from a given year group to be given regular commissions. 

An example is given in Section V B which illustrates how this model, 
in conjunction with a similar one for reserve officers, might be used to 

assist in determining and controlling future numbers of Marine Corps 

* 

officers . 
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II. DATA COLLECTION 



A. COLLECTION PROCEDURES 

Cohort size for a given year was determined from applicable sections 
of the following year's Lineal List [Ref. 1]. For example, the group of 
1956 consists of all officers receiving regular Second Lieutenant com- 
missions during the calendar year 1956, according to the 1957 Lineal 
List. Subsequent cohort size was established by counting those officers 
of an initial group that were still on active duty according to the ap- 
propriate Lineal List. An officer was considered only to be on active 
duty or not. No attempt was made to differentiate between different 
reasons for leaving, and no attempt was made to consider promotions or 
occupational specialty. 

B . DATA BASE 

Table I below lists the number and percentage of each cohort on 
active duty in succeeding calendar years after their initial commission. 
Two graphs of this data are given. Each is plotted using semi-logari- 
thmic scale. In Graph I the plot shows the percentage remaining in re- 
lation to the year after commissioning. In Graph II the percentage 
remaining is plotted against a common origin of clock time starting in 
year 1956. A straight line on these graphs would result from attrition 
at a constant rate. Concave and convex segments of the plots indicate 
increasing and decreasing attrition rates, respectively. 

Some important characteristics of the data are quite apparent in 
these graphs. There is an initial period of nearly 100A retention which 
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TABLE I 



Data For Regular Marine Corps Officer Cohorts 

1956 - 58 



fear After 
Commission 


1956 Cohort 
On Active Duty 
Number 7 0 


1957 Cohort 
On Active Duty 
Number % 


1958 Cohort 
On Active Duty 
Number % 


0 


414 


100 


339 


100 


369 


100 


1 


410 


99 


337 


99 


367 


99 


2 


408 


99 


334 


99 


362 


98 


3 


256 


62 


218 


64 


257 


70 


4 


206 


50 


190 


56 


211 


57 


5 


182 


44 


168 


50 


196 


53 


6 


176 


43 


156 


46 


187 


51 


7 


171 


41 


150 


44 


178 


48 


8 


168 


41 


144 


42 


168 


46 


9 


165 


40 


141 


42 


157 


43 


10 


156 


38 


133 


39 


137 


37 


11 


153 


37 


120 


35 


NA 


NA 


12 


143 


35 


NA 


NA 


NA 


NA 



is due to the obligation of each officer to serve a minimum of three 

years on active duty after commissioning. Following this obligation 

period both graphs show essentially similar slopes for each cohort, a 

generally decreasing attrition rate. Graph II shows a tendency toward 

an increased attrition rate for all groups in 1966 and 1967. It is not 

the purpose of this paper to analyze or explain these characteristics, 

and no attempt is made to do so. The graphs do however suggest a model, 

in that they appear to be plots of three realizations of the same, but 

unknown, random process. Formulation of the model and hypothesis testing 

* 

are discussed in the next section. 
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PERCENTAGE REMAINING ON ACTIVE DUTY 




YEAR AFTER COMMISSIONING 



GRAPH I. PERCENTAGE REMAINING ON 
ACTIVE DUTY RELATIVE TO 
YEAR AFTER COMMISSIONING. 
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GRAPH n PERCENTAGE REMAINING ON ACTIVE DUTY 
BY CALENDAR YEAR 



u 



III. THE COHORT MODEL 



A. ASSUMPTIONS 



The following basic assumptions are made in the formulation of the 
officer retention prediction model. It is assumed that all individuals 
act independently of each other with regards to leaving or staying on 
active duty. Each officer can be considered to undergo a random walk 
where p^ is the probability he stays in the system i years after en- 
trance, independent of when he joined and the number in the group in 
which he joined. 



Recall that p.(jk) = P[Member of cohort 19jk is on active 

duty i years after commissioning]. 

Let X (jk) ~ Number of officers commissioned in calendar 
year 19jk, 

and X. (jk) = Number on active duty i years after comis- 
sioning in year 19jk. 

Then, under the above assumptions, P^(jk) = p^, and X^(jk) given X^(jk), 
has the following binomial distribution; 



P[X i (ik) - n|X o (jk)] 



x 0 (.)k) 

n 

n = 




0 , 1 , 2 , 



d-p.) 



X (jk)-n 
o 



• • • jk) • 
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The theory that data from the 1956 - 58 cohorts are realizations 
of this stochastic process must now be tested. Since X^(jk) is large 
(approximately 400) we can use the well known homogenie ty test (see, 
for example, Guenther [Ref. 2]). This test establishes whether we can 
accept our theory based on the comparison of expected and observed cohort 
sizes from year to year. The test and its results are discussed below. 
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B 0 HYPOTHESIS TESTING 

To verify the foregoing assumption of stationarity , a hypothesis 
of homogeniety, as found in Guenther [Ref. 2], is tested for cohorts 
1956 - 58. 

Let r t (jk) - The expected fraction of X Q (jk) on active duty 
i years after commissioning. 

Now test the hypothesis: 



H q : r i (jk) = i = 3,4, ...9; jk = 56,57,58. 

r\ ( jk) f r i> 

That is, the fraction remaining on active duty for a given number of 
years after commissioning is tested to determine if it is independent 
of the cohort. 

The homogeniety test compares realizations with expected values, 
and can best be illustrated with a contingency table, Table II, where 
the are observed cohort sizes a given year after commissioning. 



TABLE II 



Sample Contingency Table 





Cohorts 

1956 1957 1958 


Totals 


On 

Active 

Duty 


°11 


°12 


°13 


_ . .j _ . 


Not on 
Active 
Duty 


°21 

j 

1 


°22 


°23 




i 

Totals 


1 °n 

i 


CM 

•H 

o 

H 


i 


i j 



13 



E. . 
iJ 
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Using the elements of the contingency table, expected cohort sizes, 
are computed as follows: 



E 



11 




5 5 



0 

ij 



E 



12 



Q o,j 

J 



) Q o. 2 ) 



5 




E 



23 




I I °ij 



The test value is then computed by taking 



i 

This result is compared to the appropriate value from the chi-square 
table. If the computed value is less than, or equal to, the tabled 
value, the hypothesis is accepted. 

An example of the computations done in this analysis is shown below. 
Observed data for the third year after commissioning is listed in Table 
III. Expected values for the third year after commissioning were com- 
puted as follows: 



(0. . - E. .) 
^ i.J 

E. . 
ij 
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TABLE III 



Observed Values for the Third Year After Commissioning 



Cohort 

1956 1957 1958 Totals 


On 

Active 

Duty 


256 


218 


257 


731 


Not On 
Active 
Duty 


158 


121 


112 


391 


Totals 


414 


339 


369 


1122 







414 x 731 




270 


11 




1122 








339 x 731 




221 


12 




1122 








369 x 731 




240 


13 




1122 




21 




414 x 391 
1122 


= 


144 






339 x 391 




118 


22 




1122 








369 x 391 




129 


23 




1122 





The test value is computed as: 

Test = ( 257 - 270 ) 2 (2 18 - 221 ) 2 (257 - 240) 2 

Value 270 221 240 



( 158 - 1 44) 2 + (12 1 - 118 ) 2 + ( 112 - 129 ) 2 _, , ,, 

144 118 129 

The value from the chi-square table, at a 5 % level, with two degrees of 
freedom is 5.99. Therefore the hypothesis of homogeniety for year three 
is accepted. Similar computations were done for years four through nine 
and the results are shown in Table IV. Analysis indicates that for year 
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nine the hypothesis should be rejected, however the overall hypothesis 
of homogeniety can not be rejected by this analysis. 



TABLE IV 

Homogeniety Test Results 



Year 


Test Value 


Tabled Value 


3 


4.56 


5.99 


4 


4.12 


5.99 


5 


3.59 


5.99 


6 


3.26 


5.99 


7 


1.18 


5.99 


8 


4.11 


5.99 


9 


6.71 


5.99 



C. PARAMETER ESTIMATION 



The parameters, p^, for the model are estimated using the average 
outcome of cohorts 1956 - 58. For a given year, say year i, after 
commissioning p^ is determined as follows: 

X, (56) + X. (57) + X. (58) 

P i = X (56) + X (57) + X (58) * 
o o v o 

Using this procedure, the p^ for years three through nine were computed 
and are listed in Table V. 

TABLE V 

Model Parameters, p. 

l 



Year 


P i 


3 


.65 


4 


.54 


5 


.49 


6 


.47 


7 


. 45 


8 


.43 


9 


.41 
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These probabilities are used to compute expected sizes and confidence 
limits for other cohorts » For example, the expected number remaining in 
cohort 1961, i years after commissioning is: 

E, (61) = [X (61)] p , i = 3,4, ... ,9. 

1 O 1 

The confidence interval (Cl) for this predicted value is computed as 
follows : 

CI. (61) = [E. (61) - 2 fVar (61) , E. (61) + 2 (Var. (61)], 

11 £ 1 1 

where Var. (61) = [X (61)] p B (l-p 0 ) , i = 3,4,. ..,9. 

i o 11 

Since we have binomial distributions with large n, using the Central 
Limit Theorem, we can assume normality. Therefore the use of two 
standard deviations gives a 95% confidence interval. 

Do PREDICTION RESULTS AND COMPARISON WITH REAL DATA 



Using the procedures described in Section II, selected data was 
collected on cohorts 1959-63. In Tables VI and VII, this data is com- 
pared with predictions made using the model with the parameters in 
Table V. Each of the actual cohort sizes was within the confidence in- 
terval with but one exception, year four of cohort I960. 



TABLE VI 

Actual Numbers, Predicted Numbers and Confidence Intervals 

For Cohort I960 



Year After 
Commissioning 


Actual 

Numbers 


Predicted 

Numbers 


95% Confidence 
Intervals 


3 


240 


229 


(211,247) 


4 


212 


191 


(173,209) 


5 


190 


173 


(155,191) 


6 


172 


166 


(148,184) 


7 


161 


159 


(141,177) 


8 


144 


152 


(134,170) 



17 



TABLE VII 



Actual Numbers, Predicted Numbers and Confidence Intervals 
For Selected Cohorts on Active Duty in 1968. 



Cohort 


Actual 

Numbers 


Predicted 

Numbers 


95% Confidence 
Intervals 


1959 


139 


147 


(129,165) 


1960 


144 


152 


(134,170) 


1961 


141 


141 


(123,159) 


1962 


173 


163 


(145,181) 


1963 


138 


141 


(125,157) 
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IV . MAKKOVIW _ PROPERTIES OF THh COHORT MODEL 

A significant num' er of recent models of social behavior use Markov 
chain theory* Several jch riodels are discussed by Bartholomew and 
Thonstad [Refs. 3 end A]. Because the use of Markov chain theory is so 
widespread in the literature, it is appropriate to point out the simi- 
larities and differences v etween the cohort model developed in this 
study and the Mai kev Chain Models. 

Let Y(jk; be the expected total number, from all cohorts 
considered, on active duty in year 19jko 

Using the notation shown in Section III, 

Y(jk) = X o (jk) 4 X o (jk-l) Pl + X o (jk-2)p 2 4 . . . 

Now let C(]k) = The fraction of Y(jk) who remain on active duty 
in year 19 i k 4 1 . 



So C ( jk) = 



Y(jk + 1) - X (jk 4 1) 

— 



X (jk)p 4 X (jk~Dp 4 X (jk-2)p 4 . . 

O 1 o L Q J 

X Q (jk) ■*- X o (jk-l) Pl 4 X o (jk-2)p 2 4 . 



( 1 ) 



C(jk) can be interpreted as the probability of remaining on active duty 
in year (jk-hl) given yc u are on active duty in year (jk). Clearly this 
is a function of all previous cohort sizes and in general is a function 
of (jk). 

Now assume a constant attrition rate, or equivalently that the 
curves in Graphs I and II are straight lines. Then define: 



p - ^[Peric.-. is on active duty in year t | on active 
duty in year t~l. ] . 



( 2 ) 



Clearly p = p for all t. 
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Now equation (1) becomes 

X (jk)p + X (jk-l)p 2 + X (jk-2)p 3 + . . . 

C(jk) = o_ o 

X o (jk) + X o (.1k-l)p + X o (.ik-2)p 2 + . . . 

Multiplying top and bottom by p, and cancelling, results in C(t) = p 
for all t; that is, with a constant attrition rate the expected fraction 
remaining on active duty from year to year is constant. In this case 
attrition from a cross-section depends only on the size of the cross- 
section and not on the fraction of each cohort in the cross-section. 

Thus the Markov property holds and the cohort model is equivalent to a 
Markov Model, where we define: 

State 1: On active duty. 

State 2: Not on active duty. 

The transition probability matrix is 



1 



P * 



2 



1 2 
p i-p 

0 1 



, where p is defined in (2). 



It is evident from Graphs I and II that the attrition rate is not con- 
stant for Marine Corps regular officers. 

Let us assume that, rather than a constant attrition rate, all 
cohorts are of the same initial size, that is X Q (jk) = X for all jk. 
Now equation (1) becomes: 



C(jk) 



x ( p l + P 2 + p 3 + 



X (1 + p x + p 2 + 



- id_ 



1+ 5 

j=i 



p j 





C(]k) in this case is again a constant, independent of time. In general 
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the overall attrition rate from a given cross-section does depend on 



the fraction of the cross-section which comes from a cohort of a given 
age. But if all cohorts have the same initial size the fraction of the 
cohort of age t, say, is the same in every cross-section. This leads 
to great stability in the overall attrition rates in succeeding years. 

We conclude that if cohort sizes remain nearly constant from year 
to year, then a Markov-type model will probably give good predictions. 
However, since initial cohort sizes in future years are to be thought 
of as control variables, the cohort model should describe attrition 
phenomena more accurately than the Markov-type Model. Indeed the 
figures in Table I indicate that in the past cohort sizes have differed 
s igni f icant ly . 
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V. CONCLUSIONS 



The conclusion reached from this study is that retention rates from 
different cohorts can be considered stationary over reasonable time 
periods independent of time and of group size. The groups looked at 
were commissioned up to seven years apart. Engagement in the Viet Nam 
conflict occurred at different times relative to each cohort f s entry into 
the Marine Corps. Years between promotions varied somewhat and even 
obligated service was not of the same duration for all groups. However 
with all these effects taking place, behavior was essentially stationary 
and the model provides statistically valid predictions for all cohorts 
considered . 

A. FURTHER STUDY 

The methodology developed in this study can be applied to other 
groups of interest. Possible cohorts might be: aviators. Naval Academy 
graduates, officers that were previously enlisted Marines, and regular 
officers that were initially commissioned as reserves. 

A natural follow-on study would be to develop a cost model for 
cohorts of interest. By using the average cost of obtaining an officer 
of a particular group and his expected length of service, as computed 
in his cohort analysis, an average cost per expected year of service 
could be compared for different cohorts to assist in determining the 
optimal means of officer procurement and retention. 



22 



B. USE OF THE STUDY 



The ability to predict future sizes of incoming officer cohorts un- 
doubtedly has several immediate applications in personnel planning. One 
might be to assist in determining the number of reserve officers from a 
given year group to receive regular commissions. Using this model the 
planner can estimate the number of regular officers that will be on 
active duty at some time in the future, comparison of this figure with 
anticipated requirements will give an indication of how many reserves 
will be needed to meet needs. 

Here is a simple example of how this model and methodology can be 
used to assist in officer planning. Assume that as suggested in Section 
IV A above, a similar model is developed for all other officer cohorts, 
i.e,, for those not initially commissioned as regular officers, and that 
the new model has stationarity characteristics similar to the model de- 
veloped in this study. 



Let: p.* = PfOfficer receiving a reserve commission is on active 

duty i years after commission. ] 

X ' ( jk) = Number of officers entering active duty with 
reserve commissions during calendar year jk. 

X ' ( jk) = Number of these on active duty i years after 
commission. 



Now by combining the two models, we can look at expected retention 
over a period of time, say five years for this example. This gives us 



the following: 



Expected number of 
officers , commissioned 
during 1970-74, on 
active duty in 1979. 



- X o (70)p 9 + X o '(70)p 9 ' + 

X o (7lK, -I- X o '(71)p s - 

• • • + V M >-5 1 o' ( 7 '*» p s' - 
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With this function we can address several different questions concerning 
officer retention. Three possible areas of interest are: 

1. If the X^( jk) and X^'Qk) are fixed, we can predict how 
many officers from these cohorts will be on active duty in 1979. 

2. If some X (jk) and X 1 ( jk) are fixed and some variable, 

o o 

to meet a desired total from these cohorts in 1979, we can determine 
optimal sizes for the variable inputs. 

3. If all X^(jk) and ^'(jk) are fixed and we have upper 
and lower limits on the total number that we want in 1979, then steps 
can be taken to increase or decrease attrition rates as appropriate. 
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